A procedure for estimating gestural scores from natural speech

نویسندگان

Hosung Nam

Vikramjit Mitra

Mark K. Tiede

Elliot Saltzman

Louis Goldstein

Carol Y. Espy-Wilson

Mark Hasegawa-Johnson

چکیده

Speech can be represented as a constellation of constricting events, gestures, which are defined at distinct vocal tract sites, in the form of a gestural score. Gestures and their output trajectories, tract variables, which are available only in synthetic speech, have recently been shown to improve automatic speech recognition (ASR) performance paper we propose an iterative analysis-by-synthesis based time-warping architecture to obtain gestural scores for natural speech. Given an utterance, the Haskins Laboratories Task Dynamics and Application (TADA) model generate its prototype gestural score and the corresponding synthetic acoustic output. An optimal gestural score estimated through iterative time-warping processes such that the distance between original and TADA-synthesized speech is minimized. We compared the performance of our approach to that of a conventional dynamic time warping procedure using Log-Spectral and Itakura Distance measures. We also performed a word recognition experiment using the gestural annotations to show that the gestural scores are suitable for word recognition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A procedure for estimating gestural scores from speech acoustics.

Speech can be represented as a constellation of constricting vocal tract actions called gestures, whose temporal patterning with respect to one another is expressed in a gestural score. Current speech datasets do not come with gestural annotation and no formal gestural annotation procedure exists at present. This paper describes an iterative analysis-by-synthesis landmark-based time-warping arc...

متن کامل

Estimation of articulatory gesture patterns from speech acoustics

We investigated dynamic programming (DP) and statemodel (SM) approaches for estimating gestural scores from speech acoustics. We performed a word-identification task using the gestural pattern vector sequences estimated by each approach. For a set of 75 randomly chosen words, we obtained the best word-identification accuracy (66.67%) using the DP approach. This result implies that considerable ...

متن کامل

Control concepts for articulatory speech synthesis

We present two concepts for the generation of gestural scores to control an articulatory speech synthesizer. Gestural scores are the common input to the synthesizer and constitute an organized pattern of articulatory gestures. The first concept generates the gestures for an utteranceusing the phonetic transcriptions, phone durations, and intonation commands predicted by the Bonn Open Synthesis ...

متن کامل

Generating Gestural Scores from Acoustics Through a Sparse Anchor-Based Representation of Speech

We present a procedure for generating gestural scores from speech acoustics. The procedure is based on our recent SABR (sparse, anchor-based representation) algorithm, which models the speech signal as a linear combination of acoustic anchors. We present modifications to SABR that encourage temporal smoothness by restricting the number of anchors that can be active over an analysis window. We p...

متن کامل

Articulatory phonological code for word classification

We propose a framework that leverages articulatory phonology for speech recognition. “Gestural pattern vectors” (GPV) encode the instantaneous gestural activations that exist across all tract variables at each time. Given a speech observation, recognizing the sequence of GPV recovers the ensemble of gestural activations, i.e., the gestural score. For each word in the vocabulary, we use a task d...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

A procedure for estimating gestural scores from natural speech

نویسندگان

چکیده

منابع مشابه

A procedure for estimating gestural scores from speech acoustics.

Estimation of articulatory gesture patterns from speech acoustics

Control concepts for articulatory speech synthesis

Generating Gestural Scores from Acoustics Through a Sparse Anchor-Based Representation of Speech

Articulatory phonological code for word classification

عنوان ژورنال:

اشتراک گذاری